Overview

Dataset statistics

Number of variables49
Number of observations416198
Missing cells4141442
Missing cells (%)20.3%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory155.6 MiB
Average record size in memory392.0 B

Variable types

CAT24
NUM15
BOOL10

Reproduction

Analysis started2020-07-05 20:23:33.277089
Analysis finished2020-07-05 20:26:21.939594
Duration2 minutes and 48.66 seconds
Versionpandas-profiling v2.8.0
Command linepandas_profiling --config_file config.yaml [YOUR_FILE.csv]
Download configurationconfig.yaml

Warnings

RD_NO has a high cardinality: 412429 distinct values High cardinality
CRASH_DATE has a high cardinality: 269126 distinct values High cardinality
DATE_POLICE_NOTIFIED has a high cardinality: 318866 distinct values High cardinality
STREET_NAME has a high cardinality: 1549 distinct values High cardinality
LOCATION has a high cardinality: 188585 distinct values High cardinality
LONGITUDE is highly correlated with LATITUDEHigh correlation
LATITUDE is highly correlated with LONGITUDEHigh correlation
CRASH_DATE_EST_I has 385362 (92.6%) missing values Missing
LANE_CNT has 217643 (52.3%) missing values Missing
REPORT_TYPE has 9854 (2.4%) missing values Missing
INTERSECTION_RELATED_I has 323233 (77.7%) missing values Missing
NOT_RIGHT_OF_WAY_I has 396746 (95.3%) missing values Missing
HIT_AND_RUN_I has 298508 (71.7%) missing values Missing
PHOTOS_TAKEN_I has 410947 (98.7%) missing values Missing
STATEMENTS_TAKEN_I has 407759 (98.0%) missing values Missing
DOORING_I has 414845 (99.7%) missing values Missing
WORK_ZONE_I has 413389 (99.3%) missing values Missing
WORK_ZONE_TYPE has 413971 (99.5%) missing values Missing
WORKERS_PRESENT_I has 415527 (99.8%) missing values Missing
LANE_CNT is highly skewed (γ1 = 349.9037887) Skewed
LATITUDE is highly skewed (γ1 = -109.5963231) Skewed
LONGITUDE is highly skewed (γ1 = 118.3356283) Skewed
RD_NO is uniformly distributed Uniform
CRASH_DATE is uniformly distributed Uniform
DATE_POLICE_NOTIFIED is uniformly distributed Uniform
CRASH_RECORD_ID has unique values Unique
POSTED_SPEED_LIMIT has 6598 (1.6%) zeros Zeros
LANE_CNT has 8011 (1.9%) zeros Zeros
INJURIES_TOTAL has 362531 (87.1%) zeros Zeros
INJURIES_INCAPACITATING has 407047 (97.8%) zeros Zeros
INJURIES_NON_INCAPACITATING has 384446 (92.4%) zeros Zeros
INJURIES_REPORTED_NOT_EVIDENT has 395564 (95.0%) zeros Zeros
INJURIES_NO_INDICATION has 7217 (1.7%) zeros Zeros
CRASH_HOUR has 7955 (1.9%) zeros Zeros

Variables

CRASH_RECORD_ID
Categorical

UNIQUE

Distinct count416198
Unique (%)100.0%
Missing0
Missing (%)0.0%
Memory size3.2 MiB
ccf9f2e9fb5c877901b61f8067cff682f6fa746fdc595f09fc8bb4b079e5cb3e32875df49e4d9e3553c9822dff8a7199973565bfbdf5404e9acea6439787be58
 
1
2fae16dee0a81f64460798544a5bfa9d977b7b65ea75f42289cea5cf35ac5e51392e47de98d0b82787fa0a7c9e06652d3f44ee567a029f161a132272b1d102e5
 
1
111f93bbb4ac656247638ec37f805704286899d863ae2a75625c4e699e6e894c1a4de58ad13cc1cff135acb734b7e8c8db734a44db1f3a14b0dfa72fcb6a7e49
 
1
1006d04cc120a055579b567edba7e4de622d375fa54344ccdf2e5ecd46bf13c68183f9c28998a770fc56a6e34071ed25802af1dbe842774ebb90cf53ef7b1b74
 
1
b8081464a8348ec2162f25691b7faed02fe150aef5e8bf412fd26ffd7626a8c5a1480ace673d34672b3d08faa6c868ac1df9001b4ad5571d9a2b986e95ef3564
 
1
Other values (416193)
416193
ValueCountFrequency (%) 
ccf9f2e9fb5c877901b61f8067cff682f6fa746fdc595f09fc8bb4b079e5cb3e32875df49e4d9e3553c9822dff8a7199973565bfbdf5404e9acea6439787be581< 0.1%
 
2fae16dee0a81f64460798544a5bfa9d977b7b65ea75f42289cea5cf35ac5e51392e47de98d0b82787fa0a7c9e06652d3f44ee567a029f161a132272b1d102e51< 0.1%
 
111f93bbb4ac656247638ec37f805704286899d863ae2a75625c4e699e6e894c1a4de58ad13cc1cff135acb734b7e8c8db734a44db1f3a14b0dfa72fcb6a7e491< 0.1%
 
1006d04cc120a055579b567edba7e4de622d375fa54344ccdf2e5ecd46bf13c68183f9c28998a770fc56a6e34071ed25802af1dbe842774ebb90cf53ef7b1b741< 0.1%
 
b8081464a8348ec2162f25691b7faed02fe150aef5e8bf412fd26ffd7626a8c5a1480ace673d34672b3d08faa6c868ac1df9001b4ad5571d9a2b986e95ef35641< 0.1%
 
af746c3cb9c6564ea3aad4b1aab172bd6aae4f2e5979e132b9f1bc212f388dd5e7f9c2bdfe6f93a659ca8836202fd4b7ff0afa25c7b45ae582d00dc94a72f09a1< 0.1%
 
a4e02d67c4b7b8feac99c048d74ec89b25b9faf8072b6d9a3ef7dacceae7063cce3b3248aec8a1fda2971016e9b33bb2da6b9202a9349d3e9f4e04f342fc6f101< 0.1%
 
32a307bebb1a65450fbfeee9b5215b558c671a20c678575e1f1ce7c5b9a7659dce88d4e48aa93e4a8619499915a990ee2505511e63665c10916343002c3228331< 0.1%
 
1b14476dbd80f938ac436673297ab6a6f4d46f892e03fb102a817c448bedfdc561b6f8687910f103ae3fc2aa3c0d197261061ce89d83ced8099c654a5a10345d1< 0.1%
 
05f1d2a58eb173c714b4e09a4701940e7ca732359f8359b54880150540eec5b66d28247bf0c48739a209749b9a73e7ad17c260e31285abda6ad8adbdba25d5d11< 0.1%
 
Other values (416188)416188> 99.9%
 

Length

Max length128
Median length128
Mean length128
Min length128

RD_NO
Categorical

HIGH CARDINALITY
UNIFORM

Distinct count412429
Unique (%)100.0%
Missing3769
Missing (%)0.9%
Memory size3.2 MiB
JB565226
 
1
JC345846
 
1
JC333602
 
1
JC340011
 
1
HZ279027
 
1
Other values (412424)
412424
ValueCountFrequency (%) 
JB5652261< 0.1%
 
JC3458461< 0.1%
 
JC3336021< 0.1%
 
JC3400111< 0.1%
 
HZ2790271< 0.1%
 
JA2014641< 0.1%
 
JC5017071< 0.1%
 
HY5101371< 0.1%
 
JB4120061< 0.1%
 
JB2408151< 0.1%
 
Other values (412419)41241999.1%
 
(Missing)37690.9%
 

Length

Max length8
Median length8
Mean length7.95472107
Min length3

CRASH_DATE_EST_I
Boolean

MISSING

Distinct count2
Unique (%)< 0.1%
Missing385362
Missing (%)92.6%
Memory size3.2 MiB
Y
 
26728
N
 
4108
(Missing)
385362
ValueCountFrequency (%) 
Y267286.4%
 
N41081.0%
 
(Missing)38536292.6%
 

CRASH_DATE
Categorical

HIGH CARDINALITY
UNIFORM

Distinct count269126
Unique (%)64.7%
Missing0
Missing (%)0.0%
Memory size3.2 MiB
11/10/2017 10:30:00 AM
 
27
11/10/2017 10:00:00 AM
 
20
01/12/2019 02:30:00 PM
 
20
01/12/2019 03:00:00 PM
 
18
01/12/2019 02:00:00 PM
 
18
Other values (269121)
416095
ValueCountFrequency (%) 
11/10/2017 10:30:00 AM27< 0.1%
 
11/10/2017 10:00:00 AM20< 0.1%
 
01/12/2019 02:30:00 PM20< 0.1%
 
01/12/2019 03:00:00 PM18< 0.1%
 
01/12/2019 02:00:00 PM18< 0.1%
 
02/26/2020 07:45:00 AM17< 0.1%
 
02/26/2020 08:00:00 AM16< 0.1%
 
09/04/2018 08:00:00 AM16< 0.1%
 
01/12/2019 04:00:00 PM16< 0.1%
 
01/25/2018 08:00:00 AM15< 0.1%
 
Other values (269116)416015> 99.9%
 

Length

Max length22
Median length22
Mean length22
Min length22

POSTED_SPEED_LIMIT
Real number (ℝ≥0)

ZEROS

Distinct count42
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean28.262797995184982
Minimum0
Maximum99
Zeros6598
Zeros (%)1.6%
Memory size3.2 MiB

Quantile statistics

Minimum0
5-th percentile15
Q130
median30
Q330
95-th percentile35
Maximum99
Range99
Interquartile range (IQR)0

Descriptive statistics

Standard deviation6.591077534
Coefficient of variation (CV)0.2332068302
Kurtosis8.500474378
Mean28.262798
Median Absolute Deviation (MAD)0
Skewness-1.918221926
Sum11762920
Variance43.44230306
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
3030691273.7%
 
35284696.8%
 
25249536.0%
 
20160163.8%
 
15142433.4%
 
1084282.0%
 
065981.6%
 
4037390.9%
 
534340.8%
 
4524760.6%
 
Other values (32)9300.2%
 
ValueCountFrequency (%) 
065981.6%
 
135< 0.1%
 
219< 0.1%
 
3104< 0.1%
 
41< 0.1%
 
ValueCountFrequency (%) 
9967< 0.1%
 
703< 0.1%
 
659< 0.1%
 
631< 0.1%
 
6026< 0.1%
 
Distinct count19
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size3.2 MiB
NO CONTROLS
239527
TRAFFIC SIGNAL
116124
STOP SIGN/FLASHER
 
40941
UNKNOWN
 
13406
OTHER
 
2497
Other values (14)
 
3703
ValueCountFrequency (%) 
NO CONTROLS23952757.6%
 
TRAFFIC SIGNAL11612427.9%
 
STOP SIGN/FLASHER409419.8%
 
UNKNOWN134063.2%
 
OTHER24970.6%
 
LANE USE MARKING12240.3%
 
YIELD5950.1%
 
OTHER REG. SIGN4000.1%
 
OTHER WARNING SIGN3690.1%
 
RAILROAD CROSSING GATE2860.1%
 
Other values (9)8290.2%
 

Length

Max length24
Median length11
Mean length12.29793271
Min length5

DEVICE_CONDITION
Categorical

Distinct count8
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size3.2 MiB
NO CONTROLS
241838
FUNCTIONING PROPERLY
144596
UNKNOWN
 
22621
OTHER
 
3213
FUNCTIONING IMPROPERLY
 
2333
Other values (3)
 
1597
ValueCountFrequency (%) 
NO CONTROLS24183858.1%
 
FUNCTIONING PROPERLY14459634.7%
 
UNKNOWN226215.4%
 
OTHER32130.8%
 
FUNCTIONING IMPROPERLY23330.6%
 
NOT FUNCTIONING13500.3%
 
WORN REFLECTIVE MATERIAL192< 0.1%
 
MISSING55< 0.1%
 

Length

Max length24
Median length11
Mean length13.94316888
Min length5
Distinct count12
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size3.2 MiB
CLEAR
328222
RAIN
 
38322
UNKNOWN
 
17924
SNOW
 
15638
CLOUDY/OVERCAST
 
12782
Other values (7)
 
3310
ValueCountFrequency (%) 
CLEAR32822278.9%
 
RAIN383229.2%
 
UNKNOWN179244.3%
 
SNOW156383.8%
 
CLOUDY/OVERCAST127823.1%
 
OTHER13370.3%
 
FOG/SMOKE/HAZE7820.2%
 
SLEET/HAIL6550.2%
 
FREEZING RAIN/DRIZZLE3840.1%
 
SEVERE CROSS WIND GATE88< 0.1%
 
Other values (2)64< 0.1%
 

Length

Max length24
Median length5
Mean length5.307836655
Min length4
Distinct count6
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size3.2 MiB
DAYLIGHT
272789
DARKNESS, LIGHTED ROAD
87268
DARKNESS
 
21134
UNKNOWN
 
14949
DUSK
 
12780
ValueCountFrequency (%) 
DAYLIGHT27278965.5%
 
DARKNESS, LIGHTED ROAD8726821.0%
 
DARKNESS211345.1%
 
UNKNOWN149493.6%
 
DUSK127803.1%
 
DAWN72781.7%
 

Length

Max length22
Median length8
Mean length10.70681503
Min length4

FIRST_CRASH_TYPE
Categorical

Distinct count18
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size3.2 MiB
REAR END
101600
PARKED MOTOR VEHICLE
93339
SIDESWIPE SAME DIRECTION
66394
TURNING
58260
ANGLE
43642
Other values (13)
52963
ValueCountFrequency (%) 
REAR END10160024.4%
 
PARKED MOTOR VEHICLE9333922.4%
 
SIDESWIPE SAME DIRECTION6639416.0%
 
TURNING5826014.0%
 
ANGLE4364210.5%
 
FIXED OBJECT183044.4%
 
PEDESTRIAN98612.4%
 
SIDESWIPE OPPOSITE DIRECTION61421.5%
 
PEDALCYCLIST59741.4%
 
OTHER OBJECT38820.9%
 
Other values (8)88002.1%
 

Length

Max length28
Median length10
Mean length13.46485807
Min length5

TRAFFICWAY_TYPE
Categorical

Distinct count20
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size3.2 MiB
NOT DIVIDED
187541
DIVIDED - W/MEDIAN (NOT RAISED)
75538
ONE-WAY
55263
PARKING LOT
 
29929
DIVIDED - W/MEDIAN BARRIER
 
25175
Other values (15)
42752
ValueCountFrequency (%) 
NOT DIVIDED18754145.1%
 
DIVIDED - W/MEDIAN (NOT RAISED)7553818.1%
 
ONE-WAY5526313.3%
 
PARKING LOT299297.2%
 
DIVIDED - W/MEDIAN BARRIER251756.0%
 
OTHER121032.9%
 
FOUR WAY93842.3%
 
ALLEY66841.6%
 
UNKNOWN45561.1%
 
CENTER TURN LANE36960.9%
 
Other values (10)63291.5%
 

Length

Max length31
Median length11
Mean length14.67856645
Min length4

LANE_CNT
Real number (ℝ≥0)

MISSING
SKEWED
ZEROS

Distinct count41
Unique (%)< 0.1%
Missing217643
Missing (%)52.3%
Infinite0
Infinite (%)0.0%
Mean13.35549847649266
Minimum0.0
Maximum1191625.0
Zeros8011
Zeros (%)1.9%
Memory size3.2 MiB

Quantile statistics

Minimum0
5-th percentile1
Q12
median2
Q34
95-th percentile4
Maximum1191625
Range1191625
Interquartile range (IQR)2

Descriptive statistics

Standard deviation2964.985076
Coefficient of variation (CV)222.004823
Kurtosis134371.3452
Mean13.35549848
Median Absolute Deviation (MAD)1
Skewness349.9037887
Sum2651801
Variance8791136.5
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
29091321.8%
 
44949211.9%
 
1325007.8%
 
386442.1%
 
080111.9%
 
644941.1%
 
519350.5%
 
819060.5%
 
7184< 0.1%
 
10161< 0.1%
 
Other values (31)3150.1%
 
(Missing)21764352.3%
 
ValueCountFrequency (%) 
080111.9%
 
1325007.8%
 
29091321.8%
 
386442.1%
 
44949211.9%
 
ValueCountFrequency (%) 
11916251< 0.1%
 
4336341< 0.1%
 
2996791< 0.1%
 
2184741< 0.1%
 
9021< 0.1%
 

ALIGNMENT
Categorical

Distinct count6
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size3.2 MiB
STRAIGHT AND LEVEL
405811
STRAIGHT ON GRADE
 
5096
CURVE, LEVEL
 
3126
STRAIGHT ON HILLCREST
 
1343
CURVE ON GRADE
 
615
ValueCountFrequency (%) 
STRAIGHT AND LEVEL40581197.5%
 
STRAIGHT ON GRADE50961.2%
 
CURVE, LEVEL31260.8%
 
STRAIGHT ON HILLCREST13430.3%
 
CURVE ON GRADE6150.1%
 
CURVE ON HILLCREST207< 0.1%
 

Length

Max length21
Median length18
Mean length17.94646058
Min length12
Distinct count7
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size3.2 MiB
DRY
310714
WET
 
58746
UNKNOWN
 
27285
SNOW OR SLUSH
 
14891
ICE
 
3393
Other values (2)
 
1169
ValueCountFrequency (%) 
DRY31071474.7%
 
WET5874614.1%
 
UNKNOWN272856.6%
 
SNOW OR SLUSH148913.6%
 
ICE33930.8%
 
OTHER9710.2%
 
SAND, MUD, DIRT198< 0.1%
 

Length

Max length15
Median length3
Mean length3.630392265
Min length3

ROAD_DEFECT
Categorical

Distinct count7
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size3.2 MiB
NO DEFECTS
346941
UNKNOWN
 
59680
RUT, HOLES
 
4187
OTHER
 
2395
WORN SURFACE
 
1709
Other values (2)
 
1286
ValueCountFrequency (%) 
NO DEFECTS34694183.4%
 
UNKNOWN5968014.3%
 
RUT, HOLES41871.0%
 
OTHER23950.6%
 
WORN SURFACE17090.4%
 
SHOULDER DEFECT9100.2%
 
DEBRIS ON ROADWAY3760.1%
 

Length

Max length17
Median length10
Mean length9.566516418
Min length5

REPORT_TYPE
Categorical

MISSING

Distinct count2
Unique (%)< 0.1%
Missing9854
Missing (%)2.4%
Memory size3.2 MiB
NOT ON SCENE (DESK REPORT)
247133
ON SCENE
159211
ValueCountFrequency (%) 
NOT ON SCENE (DESK REPORT)24713359.4%
 
ON SCENE15921138.3%
 
(Missing)98542.4%
 

Length

Max length26
Median length26
Mean length18.5697865
Min length3

CRASH_TYPE
Categorical

Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size3.2 MiB
NO INJURY / DRIVE AWAY
318202
INJURY AND / OR TOW DUE TO CRASH
97996
ValueCountFrequency (%) 
NO INJURY / DRIVE AWAY31820276.5%
 
INJURY AND / OR TOW DUE TO CRASH9799623.5%
 

Length

Max length32
Median length22
Mean length24.3545524
Min length22
Distinct count2
Unique (%)< 0.1%
Missing323233
Missing (%)77.7%
Memory size3.2 MiB
Y
88558
N
 
4407
(Missing)
323233
ValueCountFrequency (%) 
Y8855821.3%
 
N44071.1%
 
(Missing)32323377.7%
 

NOT_RIGHT_OF_WAY_I
Boolean

MISSING

Distinct count2
Unique (%)< 0.1%
Missing396746
Missing (%)95.3%
Memory size3.2 MiB
Y
 
17769
N
 
1683
(Missing)
396746
ValueCountFrequency (%) 
Y177694.3%
 
N16830.4%
 
(Missing)39674695.3%
 

HIT_AND_RUN_I
Boolean

MISSING

Distinct count2
Unique (%)< 0.1%
Missing298508
Missing (%)71.7%
Memory size3.2 MiB
Y
112568
N
 
5122
(Missing)
298508
ValueCountFrequency (%) 
Y11256827.0%
 
N51221.2%
 
(Missing)29850871.7%
 

DAMAGE
Categorical

Distinct count3
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size3.2 MiB
OVER $1,500
236574
$501 - $1,500
124480
$500 OR LESS
55144
ValueCountFrequency (%) 
OVER $1,50023657456.8%
 
$501 - $1,50012448029.9%
 
$500 OR LESS5514413.2%
 

Length

Max length13
Median length11
Mean length11.73067146
Min length11

DATE_POLICE_NOTIFIED
Categorical

HIGH CARDINALITY
UNIFORM

Distinct count318866
Unique (%)76.6%
Missing0
Missing (%)0.0%
Memory size3.2 MiB
02/26/2020 08:30:00 AM
 
12
09/13/2019 05:00:00 PM
 
11
06/30/2018 09:30:00 PM
 
11
01/12/2019 04:30:00 PM
 
11
02/14/2020 05:00:00 PM
 
11
Other values (318861)
416142
ValueCountFrequency (%) 
02/26/2020 08:30:00 AM12< 0.1%
 
09/13/2019 05:00:00 PM11< 0.1%
 
06/30/2018 09:30:00 PM11< 0.1%
 
01/12/2019 04:30:00 PM11< 0.1%
 
02/14/2020 05:00:00 PM11< 0.1%
 
05/14/2019 07:00:00 PM10< 0.1%
 
10/25/2017 04:30:00 PM10< 0.1%
 
05/31/2019 04:00:00 PM10< 0.1%
 
05/25/2018 06:00:00 PM10< 0.1%
 
06/03/2019 06:00:00 PM10< 0.1%
 
Other values (318856)416092> 99.9%
 

Length

Max length22
Median length22
Mean length22
Min length22
Distinct count40
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size3.2 MiB
UNABLE TO DETERMINE
151188
FAILING TO YIELD RIGHT-OF-WAY
46627
FOLLOWING TOO CLOSELY
45719
NOT APPLICABLE
 
22450
IMPROPER OVERTAKING/PASSING
 
20072
Other values (35)
130142
ValueCountFrequency (%) 
UNABLE TO DETERMINE15118836.3%
 
FAILING TO YIELD RIGHT-OF-WAY4662711.2%
 
FOLLOWING TOO CLOSELY4571911.0%
 
NOT APPLICABLE224505.4%
 
IMPROPER OVERTAKING/PASSING200724.8%
 
IMPROPER BACKING187244.5%
 
FAILING TO REDUCE SPEED TO AVOID CRASH176094.2%
 
IMPROPER LANE USAGE166824.0%
 
IMPROPER TURNING/NO SIGNAL140333.4%
 
DRIVING SKILLS/KNOWLEDGE/EXPERIENCE129413.1%
 
Other values (30)5015312.1%
 

Length

Max length80
Median length19
Mean length23.72254552
Min length6
Distinct count40
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size3.2 MiB
NOT APPLICABLE
167090
UNABLE TO DETERMINE
147957
FAILING TO REDUCE SPEED TO AVOID CRASH
 
17536
DRIVING SKILLS/KNOWLEDGE/EXPERIENCE
 
13368
FAILING TO YIELD RIGHT-OF-WAY
 
12833
Other values (35)
57414
ValueCountFrequency (%) 
NOT APPLICABLE16709040.1%
 
UNABLE TO DETERMINE14795735.5%
 
FAILING TO REDUCE SPEED TO AVOID CRASH175364.2%
 
DRIVING SKILLS/KNOWLEDGE/EXPERIENCE133683.2%
 
FAILING TO YIELD RIGHT-OF-WAY128333.1%
 
FOLLOWING TOO CLOSELY118462.8%
 
IMPROPER OVERTAKING/PASSING62431.5%
 
IMPROPER LANE USAGE61741.5%
 
WEATHER53381.3%
 
IMPROPER TURNING/NO SIGNAL42181.0%
 
Other values (30)235955.7%
 

Length

Max length80
Median length19
Mean length19.71721152
Min length6

STREET_NO
Real number (ℝ≥0)

Distinct count10858
Unique (%)2.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3602.1317882354074
Minimum0
Maximum451100
Zeros2
Zeros (%)< 0.1%
Memory size3.2 MiB

Quantile statistics

Minimum0
5-th percentile133
Q11200
median3119
Q35504
95-th percentile8755
Maximum451100
Range451100
Interquartile range (IQR)4304

Descriptive statistics

Standard deviation2906.652766
Coefficient of variation (CV)0.8069257141
Kurtosis1349.602701
Mean3602.131788
Median Absolute Deviation (MAD)2118
Skewness9.468342465
Sum1499200046
Variance8448630.299
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
10028240.7%
 
160026140.6%
 
20025340.6%
 
80025020.6%
 
30023450.6%
 
50022490.5%
 
630021070.5%
 
60020960.5%
 
120020600.5%
 
470020290.5%
 
Other values (10848)39283894.4%
 
ValueCountFrequency (%) 
02< 0.1%
 
117480.4%
 
28700.2%
 
32780.1%
 
480< 0.1%
 
ValueCountFrequency (%) 
4511001< 0.1%
 
344531< 0.1%
 
137993< 0.1%
 
137801< 0.1%
 
1377013< 0.1%
 

STREET_DIRECTION
Categorical

Distinct count4
Unique (%)< 0.1%
Missing2
Missing (%)< 0.1%
Memory size3.2 MiB
W
148717
S
136207
N
102558
E
 
28714
ValueCountFrequency (%) 
W14871735.7%
 
S13620732.7%
 
N10255824.6%
 
E287146.9%
 
(Missing)2< 0.1%
 

Length

Max length3
Median length1
Mean length1.000009611
Min length1

STREET_NAME
Categorical

HIGH CARDINALITY

Distinct count1549
Unique (%)0.4%
Missing1
Missing (%)< 0.1%
Memory size3.2 MiB
WESTERN AVE
 
11350
PULASKI RD
 
9890
CICERO AVE
 
9035
ASHLAND AVE
 
8938
HALSTED ST
 
7898
Other values (1544)
369086
ValueCountFrequency (%) 
WESTERN AVE113502.7%
 
PULASKI RD98902.4%
 
CICERO AVE90352.2%
 
ASHLAND AVE89382.1%
 
HALSTED ST78981.9%
 
KEDZIE AVE70111.7%
 
MICHIGAN AVE57641.4%
 
STATE ST51151.2%
 
NORTH AVE49481.2%
 
CLARK ST49331.2%
 
Other values (1539)34131582.0%
 

Length

Max length31
Median length10
Mean length10.68028438
Min length3

BEAT_OF_OCCURRENCE
Real number (ℝ≥0)

Distinct count275
Unique (%)0.1%
Missing4
Missing (%)< 0.1%
Infinite0
Infinite (%)0.0%
Mean1246.7594775513342
Minimum111.0
Maximum6100.0
Zeros0
Zeros (%)0.0%
Memory size3.2 MiB

Quantile statistics

Minimum111
5-th percentile124
Q1714
median1214
Q31824
95-th percentile2512
Maximum6100
Range5989
Interquartile range (IQR)1110

Descriptive statistics

Standard deviation709.4833482
Coefficient of variation (CV)0.5690619249
Kurtosis-1.018880184
Mean1246.759478
Median Absolute Deviation (MAD)590
Skewness0.1389000038
Sum518893814
Variance503366.6213
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
183458991.4%
 
12247281.1%
 
11447091.1%
 
183146881.1%
 
81341851.0%
 
81539460.9%
 
83333570.8%
 
241332130.8%
 
123231630.8%
 
83430960.7%
 
Other values (265)37521090.2%
 
ValueCountFrequency (%) 
11122160.5%
 
11216260.4%
 
11311260.3%
 
11447091.1%
 
12124530.6%
 
ValueCountFrequency (%) 
61001< 0.1%
 
253512900.3%
 
253417020.4%
 
253327300.7%
 
253210800.3%
 

PHOTOS_TAKEN_I
Boolean

MISSING

Distinct count2
Unique (%)< 0.1%
Missing410947
Missing (%)98.7%
Memory size3.2 MiB
Y
 
4085
N
 
1166
(Missing)
410947
ValueCountFrequency (%) 
Y40851.0%
 
N11660.3%
 
(Missing)41094798.7%
 

STATEMENTS_TAKEN_I
Boolean

MISSING

Distinct count2
Unique (%)< 0.1%
Missing407759
Missing (%)98.0%
Memory size3.2 MiB
Y
 
6850
N
 
1589
(Missing)
407759
ValueCountFrequency (%) 
Y68501.6%
 
N15890.4%
 
(Missing)40775998.0%
 

DOORING_I
Boolean

MISSING

Distinct count2
Unique (%)0.1%
Missing414845
Missing (%)99.7%
Memory size3.2 MiB
Y
 
926
N
 
427
(Missing)
414845
ValueCountFrequency (%) 
Y9260.2%
 
N4270.1%
 
(Missing)41484599.7%
 

WORK_ZONE_I
Boolean

MISSING

Distinct count2
Unique (%)0.1%
Missing413389
Missing (%)99.3%
Memory size3.2 MiB
Y
 
2227
N
 
582
(Missing)
413389
ValueCountFrequency (%) 
Y22270.5%
 
N5820.1%
 
(Missing)41338999.3%
 

WORK_ZONE_TYPE
Categorical

MISSING

Distinct count4
Unique (%)0.2%
Missing413971
Missing (%)99.5%
Memory size3.2 MiB
CONSTRUCTION
1583
UNKNOWN
 
289
MAINTENANCE
 
218
UTILITY
 
137
ValueCountFrequency (%) 
CONSTRUCTION15830.4%
 
UNKNOWN2890.1%
 
MAINTENANCE2180.1%
 
UTILITY137< 0.1%
 
(Missing)41397199.5%
 

Length

Max length12
Median length3
Mean length3.042515822
Min length3

WORKERS_PRESENT_I
Boolean

MISSING

Distinct count2
Unique (%)0.3%
Missing415527
Missing (%)99.8%
Memory size3.2 MiB
Y
 
605
N
 
66
(Missing)
415527
ValueCountFrequency (%) 
Y6050.1%
 
N66< 0.1%
 
(Missing)41552799.8%
 

NUM_UNITS
Real number (ℝ≥0)

Distinct count15
Unique (%)< 0.1%
Missing1772
Missing (%)0.4%
Infinite0
Infinite (%)0.0%
Mean2.021509268240892
Minimum1.0
Maximum18.0
Zeros0
Zeros (%)0.0%
Memory size3.2 MiB

Quantile statistics

Minimum1
5-th percentile1
Q12
median2
Q32
95-th percentile3
Maximum18
Range17
Interquartile range (IQR)0

Descriptive statistics

Standard deviation0.4252291736
Coefficient of variation (CV)0.2103523245
Kurtosis38.4143551
Mean2.021509268
Median Absolute Deviation (MAD)0
Skewness3.135685352
Sum837766
Variance0.1808198501
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
236596787.9%
 
1231475.6%
 
3204754.9%
 
436000.9%
 
58510.2%
 
62330.1%
 
792< 0.1%
 
829< 0.1%
 
915< 0.1%
 
109< 0.1%
 
Other values (5)8< 0.1%
 
(Missing)17720.4%
 
ValueCountFrequency (%) 
1231475.6%
 
236596787.9%
 
3204754.9%
 
436000.9%
 
58510.2%
 
ValueCountFrequency (%) 
181< 0.1%
 
161< 0.1%
 
151< 0.1%
 
122< 0.1%
 
113< 0.1%
 
Distinct count5
Unique (%)< 0.1%
Missing2669
Missing (%)0.6%
Memory size3.2 MiB
NO INDICATION OF INJURY
362524
NONINCAPACITATING INJURY
 
27954
REPORTED, NOT EVIDENT
 
16255
INCAPACITATING INJURY
 
6439
FATAL
 
357
ValueCountFrequency (%) 
NO INDICATION OF INJURY36252487.1%
 
NONINCAPACITATING INJURY279546.7%
 
REPORTED, NOT EVIDENT162553.9%
 
INCAPACITATING INJURY64391.5%
 
FATAL3570.1%
 
(Missing)26690.6%
 

Length

Max length24
Median length23
Mean length22.81441525
Min length3

INJURIES_TOTAL
Real number (ℝ≥0)

ZEROS

Distinct count18
Unique (%)< 0.1%
Missing2662
Missing (%)0.6%
Infinite0
Infinite (%)0.0%
Mean0.16713659753927107
Minimum0.0
Maximum21.0
Zeros362531
Zeros (%)87.1%
Memory size3.2 MiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30
95-th percentile1
Maximum21
Range21
Interquartile range (IQR)0

Descriptive statistics

Standard deviation0.5297513676
Coefficient of variation (CV)3.169571329
Kurtosis56.39700708
Mean0.1671365975
Median Absolute Deviation (MAD)0
Skewness5.267008107
Sum69117
Variance0.2806365115
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
036253187.1%
 
1391349.4%
 
280661.9%
 
324030.6%
 
48640.2%
 
53170.1%
 
6122< 0.1%
 
747< 0.1%
 
915< 0.1%
 
815< 0.1%
 
Other values (8)22< 0.1%
 
(Missing)26620.6%
 
ValueCountFrequency (%) 
036253187.1%
 
1391349.4%
 
280661.9%
 
324030.6%
 
48640.2%
 
ValueCountFrequency (%) 
212< 0.1%
 
191< 0.1%
 
161< 0.1%
 
153< 0.1%
 
132< 0.1%
 

INJURIES_FATAL
Categorical

Distinct count4
Unique (%)< 0.1%
Missing2662
Missing (%)0.6%
Memory size3.2 MiB
0
413179
1
 
333
2
 
20
3
 
4
ValueCountFrequency (%) 
041317999.3%
 
13330.1%
 
220< 0.1%
 
34< 0.1%
 
(Missing)26620.6%
 

Length

Max length3
Median length3
Mean length3
Min length3

INJURIES_INCAPACITATING
Real number (ℝ≥0)

ZEROS

Distinct count8
Unique (%)< 0.1%
Missing2662
Missing (%)0.6%
Infinite0
Infinite (%)0.0%
Mean0.01826443163352163
Minimum0.0
Maximum7.0
Zeros407047
Zeros (%)97.8%
Memory size3.2 MiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30
95-th percentile0
Maximum7
Range7
Interquartile range (IQR)0

Descriptive statistics

Standard deviation0.1580884453
Coefficient of variation (CV)8.655535988
Kurtosis192.6168951
Mean0.01826443163
Median Absolute Deviation (MAD)0
Skewness11.61566853
Sum7553
Variance0.02499195654
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
040704797.8%
 
157131.4%
 
25700.1%
 
3143< 0.1%
 
449< 0.1%
 
511< 0.1%
 
72< 0.1%
 
61< 0.1%
 
(Missing)26620.6%
 
ValueCountFrequency (%) 
040704797.8%
 
157131.4%
 
25700.1%
 
3143< 0.1%
 
449< 0.1%
 
ValueCountFrequency (%) 
72< 0.1%
 
61< 0.1%
 
511< 0.1%
 
449< 0.1%
 
3143< 0.1%
 

INJURIES_NON_INCAPACITATING
Real number (ℝ≥0)

ZEROS

Distinct count16
Unique (%)< 0.1%
Missing2662
Missing (%)0.6%
Infinite0
Infinite (%)0.0%
Mean0.09160024762052155
Minimum0.0
Maximum21.0
Zeros384446
Zeros (%)92.4%
Memory size3.2 MiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30
95-th percentile1
Maximum21
Range21
Interquartile range (IQR)0

Descriptive statistics

Standard deviation0.3898124196
Coefficient of variation (CV)4.255582596
Kurtosis120.6937507
Mean0.09160024762
Median Absolute Deviation (MAD)0
Skewness7.336284575
Sum37880
Variance0.1519537225
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
038444692.4%
 
1232815.6%
 
239751.0%
 
311690.3%
 
44230.1%
 
5142< 0.1%
 
658< 0.1%
 
719< 0.1%
 
86< 0.1%
 
105< 0.1%
 
Other values (6)12< 0.1%
 
(Missing)26620.6%
 
ValueCountFrequency (%) 
038444692.4%
 
1232815.6%
 
239751.0%
 
311690.3%
 
44230.1%
 
ValueCountFrequency (%) 
212< 0.1%
 
181< 0.1%
 
161< 0.1%
 
141< 0.1%
 
114< 0.1%
 

INJURIES_REPORTED_NOT_EVIDENT
Real number (ℝ≥0)

ZEROS

Distinct count12
Unique (%)< 0.1%
Missing2662
Missing (%)0.6%
Infinite0
Infinite (%)0.0%
Mean0.056340923160256906
Minimum0.0
Maximum11.0
Zeros395564
Zeros (%)95.0%
Memory size3.2 MiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30
95-th percentile0
Maximum11
Range11
Interquartile range (IQR)0

Descriptive statistics

Standard deviation0.3022171841
Coefficient of variation (CV)5.364079378
Kurtosis98.97241035
Mean0.05634092316
Median Absolute Deviation (MAD)0
Skewness7.930155486
Sum23299
Variance0.09133522636
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
039556495.0%
 
1142223.4%
 
227290.7%
 
36890.2%
 
4206< 0.1%
 
584< 0.1%
 
617< 0.1%
 
710< 0.1%
 
85< 0.1%
 
95< 0.1%
 
Other values (2)5< 0.1%
 
(Missing)26620.6%
 
ValueCountFrequency (%) 
039556495.0%
 
1142223.4%
 
227290.7%
 
36890.2%
 
4206< 0.1%
 
ValueCountFrequency (%) 
111< 0.1%
 
104< 0.1%
 
95< 0.1%
 
85< 0.1%
 
710< 0.1%
 

INJURIES_NO_INDICATION
Real number (ℝ≥0)

ZEROS

Distinct count42
Unique (%)< 0.1%
Missing2662
Missing (%)0.6%
Infinite0
Infinite (%)0.0%
Mean2.017824324847172
Minimum0.0
Maximum61.0
Zeros7217
Zeros (%)1.7%
Memory size3.2 MiB

Quantile statistics

Minimum0
5-th percentile1
Q11
median2
Q32
95-th percentile4
Maximum61
Range61
Interquartile range (IQR)1

Descriptive statistics

Standard deviation1.156723458
Coefficient of variation (CV)0.5732528067
Kurtosis88.06257539
Mean2.017824325
Median Absolute Deviation (MAD)1
Skewness4.289878612
Sum834443
Variance1.338009157
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
219840047.7%
 
112297329.5%
 
35147812.4%
 
4194144.7%
 
581031.9%
 
072171.7%
 
634340.8%
 
713060.3%
 
85960.1%
 
92420.1%
 
Other values (32)3730.1%
 
(Missing)26620.6%
 
ValueCountFrequency (%) 
072171.7%
 
112297329.5%
 
219840047.7%
 
35147812.4%
 
4194144.7%
 
ValueCountFrequency (%) 
611< 0.1%
 
501< 0.1%
 
451< 0.1%
 
423< 0.1%
 
402< 0.1%
 
Distinct count1
Unique (%)< 0.1%
Missing2662
Missing (%)0.6%
Memory size3.2 MiB
0
413536
(Missing)
 
2662
ValueCountFrequency (%) 
041353699.4%
 
(Missing)26620.6%
 

CRASH_HOUR
Real number (ℝ≥0)

ZEROS

Distinct count24
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean13.192122499387311
Minimum0
Maximum23
Zeros7955
Zeros (%)1.9%
Memory size3.2 MiB

Quantile statistics

Minimum0
5-th percentile3
Q19
median14
Q317
95-th percentile22
Maximum23
Range23
Interquartile range (IQR)8

Descriptive statistics

Standard deviation5.466229452
Coefficient of variation (CV)0.4143555711
Kurtosis-0.3949940867
Mean13.1921225
Median Absolute Deviation (MAD)4
Skewness-0.3972465967
Sum5490535
Variance29.87966442
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
16318627.7%
 
15317777.6%
 
17314997.6%
 
14280356.7%
 
18259456.2%
 
13254686.1%
 
12244345.9%
 
8234825.6%
 
11210285.1%
 
9200134.8%
 
Other values (14)15265536.7%
 
ValueCountFrequency (%) 
079551.9%
 
167891.6%
 
260291.4%
 
349731.2%
 
446211.1%
 
ValueCountFrequency (%) 
23100142.4%
 
22121492.9%
 
21131303.2%
 
20146333.5%
 
19187074.5%
 

CRASH_DAY_OF_WEEK
Real number (ℝ≥0)

Distinct count7
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean4.124536879081591
Minimum1
Maximum7
Zeros0
Zeros (%)0.0%
Memory size3.2 MiB

Quantile statistics

Minimum1
5-th percentile1
Q12
median4
Q36
95-th percentile7
Maximum7
Range6
Interquartile range (IQR)4

Descriptive statistics

Standard deviation1.969188807
Coefficient of variation (CV)0.477432707
Kurtosis-1.23070302
Mean4.124536879
Median Absolute Deviation (MAD)2
Skewness-0.0721495666
Sum1716624
Variance3.877704559
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
66759216.2%
 
76095014.6%
 
36030014.5%
 
55988414.4%
 
45952414.3%
 
25805813.9%
 
14989012.0%
 
ValueCountFrequency (%) 
14989012.0%
 
25805813.9%
 
36030014.5%
 
45952414.3%
 
55988414.4%
 
ValueCountFrequency (%) 
76095014.6%
 
66759216.2%
 
55988414.4%
 
45952414.3%
 
36030014.5%
 

CRASH_MONTH
Real number (ℝ≥0)

Distinct count12
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean6.601134556148756
Minimum1
Maximum12
Zeros0
Zeros (%)0.0%
Memory size3.2 MiB

Quantile statistics

Minimum1
5-th percentile1
Q14
median7
Q310
95-th percentile12
Maximum12
Range11
Interquartile range (IQR)6

Descriptive statistics

Standard deviation3.469570701
Coefficient of variation (CV)0.5256021782
Kurtosis-1.224863741
Mean6.601134556
Median Absolute Deviation (MAD)3
Skewness-0.03463350759
Sum2747379
Variance12.03792085
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
10380439.1%
 
6376909.1%
 
12369138.9%
 
5367178.8%
 
11358328.6%
 
9349518.4%
 
1343728.3%
 
3336518.1%
 
2329177.9%
 
8326467.8%
 
Other values (2)6246615.0%
 
ValueCountFrequency (%) 
1343728.3%
 
2329177.9%
 
3336518.1%
 
4313447.5%
 
5367178.8%
 
ValueCountFrequency (%) 
12369138.9%
 
11358328.6%
 
10380439.1%
 
9349518.4%
 
8326467.8%
 

LATITUDE
Real number (ℝ≥0)

HIGH CORRELATION
SKEWED

Distinct count188507
Unique (%)45.5%
Missing2269
Missing (%)0.5%
Infinite0
Infinite (%)0.0%
Mean41.85715475635502
Minimum0.0
Maximum42.022779861
Zeros29
Zeros (%)< 0.1%
Memory size3.2 MiB

Quantile statistics

Minimum0
5-th percentile41.71443216
Q141.78658293
median41.877441
Q341.92475429
95-th percentile41.9903545
Maximum42.02277986
Range42.02277986
Interquartile range (IQR)0.13817136

Descriptive statistics

Standard deviation0.3605778004
Coefficient of variation (CV)0.008614484251
Kurtosis12719.21078
Mean41.85715476
Median Absolute Deviation (MAD)0.06766827
Skewness-109.5963231
Sum17325890.21
Variance0.1300163502
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
41.976201145530.1%
 
41.791420282740.1%
 
41.75146062700.1%
 
41.722257272230.1%
 
41.78932932204< 0.1%
 
41.75466012194< 0.1%
 
41.90095892177< 0.1%
 
41.74257762151< 0.1%
 
41.73638005147< 0.1%
 
41.79291088141< 0.1%
 
Other values (188497)41159598.9%
 
(Missing)22690.5%
 
ValueCountFrequency (%) 
029< 0.1%
 
41.6446701311< 0.1%
 
41.644691522< 0.1%
 
41.644693975< 0.1%
 
41.644701941< 0.1%
 
ValueCountFrequency (%) 
42.022779863< 0.1%
 
42.022736321< 0.1%
 
42.022720171< 0.1%
 
42.022668931< 0.1%
 
42.022661142< 0.1%
 

LONGITUDE
Real number (ℝ)

HIGH CORRELATION
SKEWED

Distinct count188492
Unique (%)45.5%
Missing2269
Missing (%)0.5%
Infinite0
Infinite (%)0.0%
Mean-87.6721175590124
Minimum-87.934014222
Maximum0.0
Zeros29
Zeros (%)< 0.1%
Memory size3.2 MiB

Quantile statistics

Minimum-87.93401422
5-th percentile-87.77636157
Q1-87.72079453
median-87.67297631
Q3-87.63296444
95-th percentile-87.58608853
Maximum0
Range87.93401422
Interquartile range (IQR)0.087830094

Descriptive statistics

Standard deviation0.7361749743
Coefficient of variation (CV)-0.008396911069
Kurtosis14089.92941
Mean-87.67211756
Median Absolute Deviation (MAD)0.042428613
Skewness118.3356283
Sum-36290031.95
Variance0.5419535928
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
-87.905309135530.1%
 
-87.580147772740.1%
 
-87.585971992700.1%
 
-87.585275572230.1%
 
-87.74164564204< 0.1%
 
-87.74138476194< 0.1%
 
-87.61992817177< 0.1%
 
-87.63393693151< 0.1%
 
-87.62750922147< 0.1%
 
-87.74207734141< 0.1%
 
Other values (188482)41159598.9%
 
(Missing)22690.5%
 
ValueCountFrequency (%) 
-87.934014221< 0.1%
 
-87.9339939315< 0.1%
 
-87.93397651< 0.1%
 
-87.933028282< 0.1%
 
-87.927261683< 0.1%
 
ValueCountFrequency (%) 
029< 0.1%
 
-87.524587393< 0.1%
 
-87.524589012< 0.1%
 
-87.524640321< 0.1%
 
-87.524673954< 0.1%
 

LOCATION
Categorical

HIGH CARDINALITY

Distinct count188585
Unique (%)45.6%
Missing2269
Missing (%)0.5%
Memory size3.2 MiB
POINT (-87.905309125103 41.976201139024)
 
553
POINT (-87.580147768689 41.791420282098)
 
274
POINT (-87.585971992965 41.751460603167)
 
270
POINT (-87.585275565077 41.722257273006)
 
223
POINT (-87.741645644196 41.789329323265)
 
204
Other values (188580)
412405
ValueCountFrequency (%) 
POINT (-87.905309125103 41.976201139024)5530.1%
 
POINT (-87.580147768689 41.791420282098)2740.1%
 
POINT (-87.585971992965 41.751460603167)2700.1%
 
POINT (-87.585275565077 41.722257273006)2230.1%
 
POINT (-87.741645644196 41.789329323265)204< 0.1%
 
POINT (-87.741384758605 41.754660124394)194< 0.1%
 
POINT (-87.619928173678 41.900958919109)177< 0.1%
 
POINT (-87.633936930688 41.742577617335)151< 0.1%
 
POINT (-87.627509219026 41.736380045588)147< 0.1%
 
POINT (-87.742077342959 41.792910883497)141< 0.1%
 
Other values (188575)41159598.9%
 
(Missing)22690.5%
 

Length

Max length40
Median length40
Mean length39.57757846
Min length3

Interactions

Correlations

Pearson's r

The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.

To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.

Spearman's ρ

The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.

To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.

Kendall's τ

Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.

To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.

Phik (φk)

Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.

Cramér's V (φc)

Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.

Missing values

Sample

First rows

CRASH_RECORD_IDRD_NOCRASH_DATE_EST_ICRASH_DATEPOSTED_SPEED_LIMITTRAFFIC_CONTROL_DEVICEDEVICE_CONDITIONWEATHER_CONDITIONLIGHTING_CONDITIONFIRST_CRASH_TYPETRAFFICWAY_TYPELANE_CNTALIGNMENTROADWAY_SURFACE_CONDROAD_DEFECTREPORT_TYPECRASH_TYPEINTERSECTION_RELATED_INOT_RIGHT_OF_WAY_IHIT_AND_RUN_IDAMAGEDATE_POLICE_NOTIFIEDPRIM_CONTRIBUTORY_CAUSESEC_CONTRIBUTORY_CAUSESTREET_NOSTREET_DIRECTIONSTREET_NAMEBEAT_OF_OCCURRENCEPHOTOS_TAKEN_ISTATEMENTS_TAKEN_IDOORING_IWORK_ZONE_IWORK_ZONE_TYPEWORKERS_PRESENT_INUM_UNITSMOST_SEVERE_INJURYINJURIES_TOTALINJURIES_FATALINJURIES_INCAPACITATINGINJURIES_NON_INCAPACITATINGINJURIES_REPORTED_NOT_EVIDENTINJURIES_NO_INDICATIONINJURIES_UNKNOWNCRASH_HOURCRASH_DAY_OF_WEEKCRASH_MONTHLATITUDELONGITUDELOCATION
0073682ef84ff827659552d4254ad1b98bfec24935cc9cc4cbb796b0d17f98ce6028f4ac1795c51cdfa6cb6933de709f33cfe5171c175e4a0f5aebda62f2f61ceJB460108NaN10/02/2018 06:30:00 PM10NO CONTROLSNO CONTROLSCLEARDARKNESSPARKED MOTOR VEHICLEOTHERNaNSTRAIGHT AND LEVELDRYNO DEFECTSON SCENENO INJURY / DRIVE AWAYNaNNaNNaNOVER $1,50010/02/2018 07:35:00 PMNOT APPLICABLENOT APPLICABLE517WOHARE ST1654.0NaNNaNNaNNaNNaNNaN2.0NO INDICATION OF INJURY0.00.00.00.00.01.00.018310NaNNaNNaN
11560fb8a1e32b528fef8bfd677d2b3fc5ab37278b157fa5ecf66064359bdad3803b9d298d234eb31f463d08515c791b5627fbf7afea3765d1c1a9f3befc5bdccJC325941NaN06/27/2019 04:00:00 PM45NO CONTROLSNO CONTROLSCLEARDAYLIGHTSIDESWIPE SAME DIRECTIONONE-WAYNaNSTRAIGHT AND LEVELDRYNO DEFECTSON SCENENO INJURY / DRIVE AWAYNaNNaNNaNOVER $1,50006/27/2019 04:00:00 PMUNABLE TO DETERMINEUNABLE TO DETERMINE3WTERMINAL ST1653.0NaNNaNNaNNaNNaNNaN2.0NO INDICATION OF INJURY0.00.00.00.00.02.00.01656NaNNaNNaN
2009e9e67203442370272e1a13d6ee51a4155dac65e583d1bdbee1fde686de7508c14ab5f205402f72644001276718917e02985561dc71b7d4bf945f09d7d47f5JA329216NaN06/30/2017 04:00:00 PM35STOP SIGN/FLASHERFUNCTIONING PROPERLYCLEARDAYLIGHTTURNINGNOT DIVIDED4.0STRAIGHT AND LEVELDRYNO DEFECTSON SCENEINJURY AND / OR TOW DUE TO CRASHYNaNNaNOVER $1,50006/30/2017 04:01:00 PMFAILING TO YIELD RIGHT-OF-WAYNOT APPLICABLE8301SCICERO AVE834.0NaNNaNNaNNaNNaNNaN2.0NO INDICATION OF INJURY0.00.00.00.00.03.00.0166641.741804-87.740954POINT (-87.740953581987 41.741803598989)
300e47f189660cd8ba1e85fc63061bf1d8465184393f134fb8251ed7896a4ba9ed7c984ab51a01f564d6f4133c6ef8493b1a369743a4a308d4392900a286e160fJC194776NaN03/21/2019 10:50:00 PM30TRAFFIC SIGNALFUNCTIONING PROPERLYCLEARDARKNESS, LIGHTED ROADTURNINGNOT DIVIDED4.0STRAIGHT AND LEVELDRYNO DEFECTSON SCENENO INJURY / DRIVE AWAYYNaNNaNOVER $1,50003/21/2019 10:52:00 PMUNABLE TO DETERMINEUNABLE TO DETERMINE8301SCICERO AVE834.0NaNYNaNNaNNaNNaN2.0NO INDICATION OF INJURY0.00.00.00.00.02.00.0225341.741804-87.740954POINT (-87.740953581987 41.741803598989)
40126747fc9ffc0edc9a38abb83d80034f897db0f739eef57f9bc75de8f2702a4c8f6dd8e49f5c2e810e1ec428bd9532fd0e6c583ca72669da9e65fc2a0a6de12JB200478NaN03/26/2018 02:23:00 PM35NO CONTROLSNO CONTROLSCLEARDAYLIGHTPARKED MOTOR VEHICLENOT DIVIDEDNaNSTRAIGHT AND LEVELDRYNO DEFECTSNOT ON SCENE (DESK REPORT)NO INJURY / DRIVE AWAYNaNNaNNaN$501 - $1,50003/26/2018 03:20:00 PMUNABLE TO DETERMINEUNABLE TO DETERMINE3999NAVONDALE AVE1732.0NaNNaNNaNNaNNaNNaN2.0NO INDICATION OF INJURY0.00.00.00.00.02.00.0142341.953647-87.732082POINT (-87.732081736006 41.953646899951)
55d672ce84d5b78346be822b388604bdf9cb3fa348a5adc89501859f857e1e9308e35ececb0a56527c6e6d065cc85e4e93302c1c57a97068034116e8b23ec2f4eJD158927NaN02/20/2020 04:45:00 PM35TRAFFIC SIGNALOTHERCLEARDAWNREAR ENDT-INTERSECTIONNaNSTRAIGHT AND LEVELDRYNO DEFECTSON SCENENO INJURY / DRIVE AWAYYNaNNaN$501 - $1,50002/20/2020 04:51:00 PMNOT APPLICABLENOT APPLICABLE12300WIRVING PARK RD1654.0NaNNaNNaNNaNNaNNaN2.0NO INDICATION OF INJURY0.00.00.00.00.02.00.0165241.958987-87.933994POINT (-87.933993928974 41.958986950953)
60209e21f298984f7375742b7ef27c9880b485f41123a12b5a8eb14f01171abbbc05974399b985e05a352f801869cae0f41587f39a51994338fb82aeba853eeceJB415436NaN08/30/2018 05:45:00 PM30TRAFFIC SIGNALFUNCTIONING PROPERLYCLEARDAYLIGHTTURNINGNOT DIVIDEDNaNSTRAIGHT AND LEVELDRYUNKNOWNNOT ON SCENE (DESK REPORT)NO INJURY / DRIVE AWAYYNaNNaNOVER $1,50008/30/2018 05:58:00 PMIMPROPER OVERTAKING/PASSINGIMPROPER LANE USAGE600WDIVISION ST1822.0NaNNaNNaNNaNNaNNaN2.0NO INDICATION OF INJURY0.00.00.00.00.02.00.0175841.903825-87.643286POINT (-87.643286359995 41.903825233976)
70211e1f766f3940dfa87375661d25b716655e908c320cc46910e8fa5fb1f1e6a9d4f714d21e8e401ec9e0a12190b6cd9f6dbc97d32d0c0fc966a02ae516e782fJC301403NaN06/11/2019 08:40:00 AM30TRAFFIC SIGNALFUNCTIONING PROPERLYCLEARDAYLIGHTREAR ENDDIVIDED - W/MEDIAN BARRIERNaNSTRAIGHT AND LEVELDRYNO DEFECTSNOT ON SCENE (DESK REPORT)NO INJURY / DRIVE AWAYYNaNNaN$501 - $1,50006/11/2019 09:05:00 AMUNABLE TO DETERMINENOT APPLICABLE50EGARFIELD BLVD225.0NaNNaNNaNNaNNaNNaN2.0NO INDICATION OF INJURY0.00.00.00.00.03.00.083641.794779-87.623828POINT (-87.623828038036 41.794778764028)
802e2ed3606a50dda185f5e97c57a45552087d6fbea1c4b5f3777e0503da72279211f0518aabbeca2cd8e8ee1aca6cae3f88a0531cb62bb39ac156ca3d55e0931JB256393NaN05/09/2018 11:30:00 AM25NO CONTROLSNO CONTROLSRAINDAYLIGHTANGLENOT DIVIDED2.0STRAIGHT AND LEVELWETNO DEFECTSON SCENENO INJURY / DRIVE AWAYNaNNaNNaNOVER $1,50005/09/2018 11:35:00 AMFAILING TO YIELD RIGHT-OF-WAYUNABLE TO DETERMINE9511SWENTWORTH AVE511.0NaNNaNNaNNaNNaNNaN2.0NO INDICATION OF INJURY0.00.00.00.00.02.00.0114541.721290-87.628510POINT (-87.628509593966 41.72128957001)
903c8fee8a0cb0d303e972a873228b444a47b7b1ed1e2d97a8c409203dc81a9d97abc26692d325af7428a2f8f880ab0e551e763782226b9b6a0c3e19abd7ffa23JB317419NaN06/22/2018 07:25:00 AM35TRAFFIC SIGNALFUNCTIONING PROPERLYRAINDAYLIGHTTURNINGNOT DIVIDED6.0STRAIGHT AND LEVELWETNO DEFECTSON SCENEINJURY AND / OR TOW DUE TO CRASHYNaNNaNOVER $1,50006/22/2018 07:27:00 AMUNABLE TO DETERMINEUNABLE TO DETERMINE8301SCICERO AVE834.0NaNNaNNaNNaNNaNNaN2.0NONINCAPACITATING INJURY2.00.00.02.00.02.00.076641.741804-87.740954POINT (-87.740953581987 41.741803598989)

Last rows

CRASH_RECORD_IDRD_NOCRASH_DATE_EST_ICRASH_DATEPOSTED_SPEED_LIMITTRAFFIC_CONTROL_DEVICEDEVICE_CONDITIONWEATHER_CONDITIONLIGHTING_CONDITIONFIRST_CRASH_TYPETRAFFICWAY_TYPELANE_CNTALIGNMENTROADWAY_SURFACE_CONDROAD_DEFECTREPORT_TYPECRASH_TYPEINTERSECTION_RELATED_INOT_RIGHT_OF_WAY_IHIT_AND_RUN_IDAMAGEDATE_POLICE_NOTIFIEDPRIM_CONTRIBUTORY_CAUSESEC_CONTRIBUTORY_CAUSESTREET_NOSTREET_DIRECTIONSTREET_NAMEBEAT_OF_OCCURRENCEPHOTOS_TAKEN_ISTATEMENTS_TAKEN_IDOORING_IWORK_ZONE_IWORK_ZONE_TYPEWORKERS_PRESENT_INUM_UNITSMOST_SEVERE_INJURYINJURIES_TOTALINJURIES_FATALINJURIES_INCAPACITATINGINJURIES_NON_INCAPACITATINGINJURIES_REPORTED_NOT_EVIDENTINJURIES_NO_INDICATIONINJURIES_UNKNOWNCRASH_HOURCRASH_DAY_OF_WEEKCRASH_MONTHLATITUDELONGITUDELOCATION
416188e03b02d64d5d4db07c0c24fe1e9c495b4ac5b67cbe1e7334d16533b47b2a410bf1481830ec85ab046c4b391bf689476f68af6df1437f9e21cd10cacb10f7308aJD236641NaN05/19/2020 01:00:00 PM25NO CONTROLSNO CONTROLSCLEARDAYLIGHTREAR ENDONE-WAYNaNSTRAIGHT AND LEVELWETNO DEFECTSNOT ON SCENE (DESK REPORT)NO INJURY / DRIVE AWAYNaNNaNY$501 - $1,50005/19/2020 02:30:00 PMNOT APPLICABLENOT APPLICABLE1000SKOLMAR AVE1131.0NaNNaNNaNNaNNaNNaN2.0NO INDICATION OF INJURY0.00.00.00.00.02.00.0133541.869118-87.739401POINT (-87.739401314183 41.869117777281)
416189f198c65371434926c14b2a63ed8792a1389fadaf675d8ee368dc5aaf1a30467a894c1412b2f099985c04308755ad76dfd1589de86c674ff6050c527539baee0dJD236802NaN05/19/2020 03:31:00 PM30TRAFFIC SIGNALFUNCTIONING PROPERLYCLEARDAYLIGHTTURNINGFOUR WAYNaNSTRAIGHT AND LEVELDRYNO DEFECTSON SCENENO INJURY / DRIVE AWAYYNaNNaN$501 - $1,50005/19/2020 03:51:00 PMIMPROPER TURNING/NO SIGNALNOT APPLICABLE2001EMARQUETTE DR331.0NaNNaNNaNYCONSTRUCTIONN2.0NO INDICATION OF INJURY0.00.00.00.00.02.00.0153541.775563-87.576444POINT (-87.576444451178 41.775562727526)
416190f52f00333e5cac1530939b110da25a72de89372c032ae422467081eb937a8ff841ff2f5d73c28ededd41b765fad1e5681c1b039b5c02435bb9edb9ee97c94eabJD236940NaN05/19/2020 06:21:00 PM30NO CONTROLSNO CONTROLSCLOUDY/OVERCASTDAYLIGHTSIDESWIPE SAME DIRECTIONDIVIDED - W/MEDIAN (NOT RAISED)NaNSTRAIGHT AND LEVELDRYNO DEFECTSON SCENENO INJURY / DRIVE AWAYNaNNaNNaN$500 OR LESS05/19/2020 06:21:00 PMFAILING TO YIELD RIGHT-OF-WAYNOT APPLICABLE2473NCLARK ST1935.0NaNNaNNaNNaNNaNNaN2.0NO INDICATION OF INJURY0.00.00.00.00.03.00.0183541.927478-87.641450POINT (-87.641449775903 41.927477726852)
416191d97477e587b8eecbdb0dbce0a0c5462e7335a4f7da2e04c8e4799193cba69f0b887de56a831e8899031c1f7175e82a2ad8c43ba34260181a3e4e1bafda37ea88JD234027NaN05/16/2020 11:45:00 AM40NO CONTROLSNO CONTROLSCLEARDAYLIGHTSIDESWIPE SAME DIRECTIONDIVIDED - W/MEDIAN BARRIERNaNCURVE, LEVELDRYNO DEFECTSON SCENEINJURY AND / OR TOW DUE TO CRASHNaNNaNNaNOVER $1,50005/16/2020 11:55:00 AMFAILING TO YIELD RIGHT-OF-WAYNOT APPLICABLE120NLAKE SHORE DR SB114.0NaNNaNNaNNaNNaNNaN2.0NO INDICATION OF INJURY0.00.00.00.00.02.00.0117541.883642-87.615441POINT (-87.615441182232 41.883641778749)
416192e43ffefbcb0c4518869aac0447be15af688b10f56d652adb93a204fc18d823594c6cf5e64bcf12134ee2e91a692e0434118a1367b5a5904e448f4cdbc850bfb5JD234946NaN05/17/2020 01:22:00 PM30NO CONTROLSNO CONTROLSRAINDAYLIGHTPARKED MOTOR VEHICLEDIVIDED - W/MEDIAN BARRIERNaNSTRAIGHT AND LEVELWETUNKNOWNON SCENEINJURY AND / OR TOW DUE TO CRASHNaNNaNNaN$501 - $1,50005/17/2020 01:22:00 PMIMPROPER LANE USAGEFAILING TO REDUCE SPEED TO AVOID CRASH1924WGARFIELD BLVD932.0NaNNaNNaNNaNNaNNaN3.0NO INDICATION OF INJURY0.00.00.00.00.01.00.0131541.794040-87.673019POINT (-87.673018619292 41.79403960152)
416193de4cbde297443228427759c6d58f40254dc8d8fcb201dc731da85d4d0dcd05fd2fbb10974bc0d17cdecd4128959af5b2fc2a4b8a0ee42f3370be201aba68a5bdJD235651NaN05/18/2020 12:20:00 PM10NO CONTROLSNO CONTROLSCLEARDAYLIGHTREAR TO SIDEPARKING LOTNaNSTRAIGHT AND LEVELDRYNO DEFECTSNOT ON SCENE (DESK REPORT)NO INJURY / DRIVE AWAYNaNNaNNaN$500 OR LESS05/18/2020 12:32:00 PMFAILING TO YIELD RIGHT-OF-WAYUNABLE TO DETERMINE1527WLAWRENCE AVE1912.0NaNNaNNaNNaNNaNNaN2.0NO INDICATION OF INJURY0.00.00.00.00.02.00.0122541.968773-87.668624POINT (-87.668624408172 41.968772794281)
416194eb65bb1c2f912fbd1a80406813bbe831df2083d483334697387d443e911400cfad731b727f3ab2ab982ab9583e12ef1bba31838e50a2d4c3979a28223b8ef00fJD235725NaN05/18/2020 09:00:00 AM30UNKNOWNUNKNOWNUNKNOWNUNKNOWNPARKED MOTOR VEHICLEUNKNOWNNaNSTRAIGHT AND LEVELUNKNOWNUNKNOWNNOT ON SCENE (DESK REPORT)INJURY AND / OR TOW DUE TO CRASHNaNNaNYOVER $1,50005/18/2020 01:48:00 PMUNABLE TO DETERMINENOT APPLICABLE4015NPARKSIDE AVE1624.0NaNNaNNaNNaNNaNNaN2.0NO INDICATION OF INJURY0.00.00.00.00.01.00.092541.953658-87.768095POINT (-87.768095363598 41.953657967626)
416195d35263055740651327a467ea3b5181fde21e1edad1e917c4096a7ca360f3ef47a2d3dc69666c6fc9bdfe7a9fdf8420654f251fcdf7c657bddc51a3a263e6eaf5JD236504NaN05/19/2020 02:00:00 AM20NO CONTROLSNO CONTROLSRAINDARKNESS, LIGHTED ROADPARKED MOTOR VEHICLEONE-WAYNaNSTRAIGHT AND LEVELWETNO DEFECTSNOT ON SCENE (DESK REPORT)NO INJURY / DRIVE AWAYNaNNaNNaN$500 OR LESS05/19/2020 11:00:00 AMUNABLE TO DETERMINEUNABLE TO DETERMINE5700NRICHMOND ST2011.0NaNNaNNaNNaNNaNNaN2.0NO INDICATION OF INJURY0.00.00.00.00.01.00.023541.985008-87.703007POINT (-87.70300710512 41.985008248097)
416196d613f48076a4221ebaebf9fc31bf396baaf18892529f3f97474e6795d01eb9545440ed9dc0d31201c554798cce001416f0aff7f4214f3848bdb211d8f89915afJD236487NaN05/19/2020 09:40:00 AM30NO CONTROLSNO CONTROLSCLEARDAYLIGHTPARKED MOTOR VEHICLENOT DIVIDEDNaNSTRAIGHT AND LEVELDRYNO DEFECTSNOT ON SCENE (DESK REPORT)NO INJURY / DRIVE AWAYNaNNaNNaN$501 - $1,50005/19/2020 10:36:00 AMUNABLE TO DETERMINEUNABLE TO DETERMINE7715SCOTTAGE GROVE AVE624.0NaNNaNNaNNaNNaNNaN2.0NO INDICATION OF INJURY0.00.00.00.00.01.00.093541.754447-87.605136POINT (-87.605136112415 41.75444668619)
416197f247683bbffde5cd7d1bddad30ff42c2d3d4594ffd218dc9bdbb270b88edac4d3e2c87a077ea21bede21d27eb2d508c8c9e0a42595a81d1221035e85aeea04cdJD236796NaN05/19/2020 01:32:00 PM10NO CONTROLSNO CONTROLSCLEARDAYLIGHTOTHER OBJECTALLEYNaNSTRAIGHT AND LEVELDRYNO DEFECTSNOT ON SCENE (DESK REPORT)NO INJURY / DRIVE AWAYNaNNaNNaNOVER $1,50005/19/2020 03:46:00 PMUNABLE TO DETERMINEUNABLE TO DETERMINE925SINDEPENDENCE BLVD1133.0NaNNaNNaNNaNNaNNaN1.0NO INDICATION OF INJURY0.00.00.00.00.01.00.0133541.869263-87.719477POINT (-87.719476770345 41.869263306188)